SemanticScuttle - klotz.me » klotz: deep learning+huggingface

klotz: deep learning* + huggingface*

0 bookmark(s) - Sort by: Date ↓ / Title / - Bookmarks from other users for this tag

From Zero to Reasoning Hero: How DeepSeek-R1 Leverages Reinforcement Learning to Master Complex Reasoning

The article explores the DeepSeek-R1 models, focusing on how reinforcement learning (RL) is used to develop advanced reasoning capabilities in AI. It discusses the DeepSeek-R1-Zero model, which learns reasoning without supervised fine-tuning, and the DeepSeek-R1 model, which combines RL with a small amount of supervised data for improved performance. The article highlights the use of distillation to transfer reasoning patterns to smaller models and addresses challenges and future directions in RL for AI.

2025-02-06 Tags: deepseek-r1, reinforcement learning, distillation, llm, huggingface, machine learning by klotz
AI Weekly: Researchers attempt an open source alternative to GitHub's Copilot | VentureBeat

2021-09-25 Tags: transformers, fine-tune, deep learning, gpt-3, huggingface, codex by klotz
samrawal/emacs-secondmate: An open-source, mini imitation of GitHub Copilot for Emacs.

2021-07-28 Tags: eleutherai, ai, emacs, gpt-2, gpt-3, huggingface, code, autocomplete, how-do-i, machine learning, deep learning by klotz
zero-shot-pipeline-sentiment.ipynb - Colaboratory

2020-08-28 Tags: colab, google, notebook, huggingface, transformers, gpt-2, nlp, deep learning, classification, text, zero-shot by klotz
Zero-Shot Text Classification with Hugging Face | by Andrej Baranovskij | Aug, 2020 | Towards Data Science

This post is about detecting text sentiment in an unsupervised way, using Hugging Face zero-shot text classification model.

2020-08-28 Tags: text, classification, huggingface, nlp, deep learning by klotz
Hugging Face: State-of-the-Art Natural Language Processing in ten lines of TensorFlow 2.0

2019-11-01 Tags: deep learning, nlp, huggingface, bert, roberta, gpt-2 distilbert, tensorflow by klotz

Top of the page

First / Previous / Next / Last / Page 1 of 0

About - Propulsed by SemanticScuttle